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Objectives 


The principal objective of this unit is to establish some general important 
results which will form the basis of applications to be considered in later 
units. 


After working through this unit, you should be able to: 
(i) explain the meanings of the terms: 
dimension of a vector space, 
morphism from one vector space to another, 
kernel of a morphism, 
matrix, 
identity matrix ; 
(ii) understand the proofs of the theorems about morphisms of vector 
spaces which are listed in section 23.1.6; 
(iii) add and multiply (suitable) matrices ; 
(iv) form a scalar multiple of a matrix. 


Note 


Before working through this correspondence text, make sure you have 
read the general introduction to the mathematics course in the Study 
Guide, as this explains the philosophy underlying the whole course. 
You should also be familiar with the section which explains how a text 
is constructed and the meanings attached to the stars and other symbols 
in the margin, as this will help you to find your way through the text. 
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Structural Diagram 


‘ Vector Spaces 
1 Unit 22 


Mappings between 
Vector Spaces 
23.1.1 


= 
fey 
ss] 
= 
z 
3 
a 


Morphisms 
23.1,2 


e 
' 

! Simultaneous Kernel of a 
| Equations Morphism 

H R.B.5 23.1.3, 23.1.4 
1 


23.2.1 


An Algebra of 
Morphisms 
23.1.5 


Matrices 
23,2,2 


Matrix 
Multiplication 
23.2.3 


Addition and Scalar, 
Multiplication of 
Matrices 

23.2.5 


Linear Equations 
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Solution of Linear 
Equations 
23.1.4 


Linear Algebra 
Units 26 and 28 


Geometrical 
Applications 
23.2.4 


Glossary 


Terms which are defined in this glossary are printed in CAPITALS, 


EQUALITY OF 
MATRICES, 


IDENTITY MATRIX 


INVARIANT ELEMENT 


INVARIANT SET 


KERNEL 


LINEAR EQUATION 


LINEAR 
TRANSFORMATION 


MATRIX (mm X 71) 


MATRIX ADDITION 


MATRIX 
MULTIPLICATION 


MATRIX 
SUBTRACTION 


MORPHISM 


MULTIPLICATION OF 
A MATRIX BY A 
SCALAR 


NULL SPACE 


SUBSPACE 


Two MATRICES are EQUAL if they have the same 
numbers of rows and columns, and the correspond- 
ing elements are the same. 


The IDENTITY MATRIX is a MATRIX with the same 
number of rows as columns, in which all the 
elements on the diagonal from the top left-hand 
corner to the bottom right-hand corner are 1’s, 
and all other elements are 0's. 


An INVARIANT ELEMENT under a function f is an 
element a of the domain of f such that f(a) = a. 


An INVARIANT SET under a function f is a subset 4 
of the domain of fsuch that (A) = A. 


The KERNEL of a MORPHISM is the set of elements 
in the domain which map to the zero element in 
the codomain of the morphism. 


A LINEAR EQUATION is an equation of the form 
GyXy + GgX2 + +++ + aX, = 0, 
where the a’s are constants. 


A LINEAR TRANSFORMATION is a MORPHISM from one 
vector space to another. 


AMATRIX(m x n)isarectangulararray of numbers, 
having m rows and n columns. 


See page 43. 
See page 37, 


See page 43. 


If L is a mapping from a vector space V to a vector 
space U such that 


Lay, + Mab.) = %,L(v4) + o2L(v2), 


for any #,, a) € R, vy, v2 € V, then L is a MORPHISM 
from V to U. 


See page 43. 


The NULL SPACE of a MORPHISM is the KERNEL of the 
morphism. 


If S is a subset of a vector space U, and S is itself a 
vector space, then S is a suBspace of U. 


42 


28 


43 


Notation 
bik 


R" 


Ry 


he 


Geometric vectors of length one in the directions of 
the x, y and z Cartesian axes respectively. 


The set of all n-tuples of real numbers, ie. the 
Cartesian product, 
RxXRx+-x*R, 
eS 
terms 
The function f, considered as an element of a 
vector space, 


A temporary symbol for the operation of addition 
on a vector space V. 


The zero element of the vector space V. 
(1,.0,..5510) 
(0, 1,...,0) 


These n-tuples form a basis for R". 
(Qi Oss.) 
A matrix with two rows and three columns, 


A temporary symbol for the operation of multi- 
plication of two matrices A and B (in that order) 
where B has just one column, 

A temporary symbol for the operation of matrix 
multiplication. 


The matrix corresponding to the mapping which 
rotates the plane about the origin through an angle 
6 anti-clockwise, i.e. 
e 0 —sind 
sin 0 cos 0) 
The identity matrix. 


A temporary symbol for the operation of matrix 
addition, 
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H. S. Wilf, Calculus and Linear Algebra (Harcourt, Brace and World 
1966). 

This book is easy to read and, as the title suggests, may be a help in the 
Calculus as well as Linear Algebra. The material relevant to this unit is 
covered in Chapter 6; we shall cover the first part of this chapter later in 
the course, so section 6.1 may be omitted at this stage. 


F. Loonstra, Introduction to Algebra (McGraw-Hill 1967). 

Written in a different style to the first book, this text is a more formal 
introduction to Algebra. It is very clearly written. The relevant chapter is 
again Chapter 6, 


S. Lang, Introduction to Linear Algebra (Addison-Wesley 1970). 

The approach is similar to ours in that vector spaces are introduced by 
way of geometrical vectors. The order in which matrices and morphisms 
are discussed is slightly different from ours, but nevertheless Chapters 4 
and 5 cover the material of this unit. 
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23.1 MAPPINGS OF VECTOR SPACES 24 
23.1.0 Introduction 23.1.0 
In Unit 22, Linear Algebra I we considered the set of geometric vectors. Introduction 


and defined the terms linear dependence and independence, dimension 
and basis, We then extended our use of these terms from a geometric 
context to an algebraic one: we defined a vector space. The examples of 
vector spaces given in Unit 22 are interesting because, although they look 
very different, they all belong under the same umbrella. However, just 
looking at examples does not give us any extra equipment for solving 
problems. In this unit we shall establish some general results for vector 
spaces, 


You will recall that in Unit / we began by mapping sets one to another, 
and we introduced the concept of a function. With this concept we found 
that we could think about problems which are more sophisticated and 
have wider application than simple arithmetic calculations. It is the same 
with vector spaces; the topic becomes richer and more interesting when 
we introduce mappings from one vector space to another. 


23.1.1 Mapping One Vector Space to Another 23.1 


Some mappings of vector spaces to vector spaces are simply equivalent 
to a change of notation, 


Example | Example 1 


We know that the set of geometric vectors forms a vector space, as does 
the set of all lists with three elements, 


Ify = xi + yj + 2k, then the mapping 


with domain the set of geometric vectors, simply gives us an alternative 
notation, 


(x,y, 2) 
LG xi+ yitzk 


(i, j and k are the unit geometric vectors in the directions of the x, y and z 
Cartesian axes respectively: see Unit 22, Linear Algebra I.) 


Under this mapping we have 


1 
no 
\0/ 


0 

n:k-—10 

1 
We can calculate n (y) for any geometric vector x once we know the images 
of the base vectors. a 


Notice that if we want a mapping to define a new notation for the elements 
in its domain then the mapping must be one-one. 


Another “notational” mapping is the mapping of ordered lists to ordered 
pairs defined by 


x! 
Jon 
y 


or ordered lists to ordered triples defined by 


oe 


Y | (X52), 
Zz 


which simply states the obvious fact that, given a co-ordinate system, 
we can represent a two-element list by a point in a plane and a three- 
element list by a point in three-dimensional space. We already have a 
name for the set of all ordered pairs of real numbers; we call it R x R. 
This is usually shortened to R?. R° is used to denote the set of all ordered 
triples of real numbers, and R" is used to denote the set of all ordered 
n-tuples of real numbers. We have seen that R", with addition and mul- 
tiplication by a scalar defined in the usual way, is a vector space of dimen- 
sion n, We shall refer to this vector space as R", leaving the operations to 
be understood, 


Example 2 
Consider the mapping of R? to R* defined by 
f(x, y) — (2x, 2y). 


What effect does this mapping have? One way to start to answer this 
question is to consider what happens to a few particular elements. 


Take the element (1, 1); then f(1, 1) = (2, 2). 


2 (2,2) 
44 6 (1) 
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Example 2 


For the element (0, 1), we have f(0, 1) = (0, 2). 


(on (0,2) 


In general, we see that the point P is mapped on to the point P’ in the 
same direction away from the origin O, where OP’ = 20P. (Does any 
point remain unchanged?) 


Consider a set of elements in the plane, such as 
{(x, y):x? + y? = 1). 


These points lie on a circle of unit radius, and therefore their images 
under this mapping will lie on a circle of radius 2. We can verify this 
algebraically as follows. Suppose (x, y)+— (u, v), then u = 2x and v = 2y, 
and so if x and y satisfy the equation 


?+y=1, 
u and v must satisfy the equation 


by += 


ie. w+vr=4 
Thus 
{(x, y)ix? + y? = 1}-— {(u, v):u? + 0? = 4} 
= {(x,y)ix? + y? = 4} 


(We have re-written this set in terms of x and y so that we can draw the 
following diagram.) 


y y 
. x ; { } x 
a 
Example 3 


Consider the mapping of R? to R? defined by 
(x,y) (—y, x). 


Let us have a look at what happens to the base vectors (1,0) and (0, 1). 
We have 


(1,0) (0, 1) and = (0, 1) + (— 1, 0). 
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(See RB4) 


Example 3 


The mapping has the effect of rotating the base vectors through an angle 


n . os ean 
3 counter-clockwise about the origin, and this is indeed the effect on the 


y y 
Ae or) 
ie 


entire plane. 


A 
NI 


In this example, any circle centred at the origin maps onto itself. Every 
point of the set {(x,)):x? + y* = 1} moves (for example, (3, $)—>(—#,3)), 
but the set itself remains unchanged. a 


Example 3 provides a particular example of a simple but important 
mathematical concept. Given a function f of a set A to itself, then if, 
for instance, 


f(a)=a (ae A), 
we say that a is an invariant element under f. 


If the set S as a whole maps to itself, then S is called an invariant set under f, 
even if some or all of the elements of the subset S of A are moved under /. 
In the above example we have an invariant circle, i.e. an invariant set of 
points. The concept of invariance can be even more general. For example, 
under a translation of the plane, the distance between two points is 
invariant; that is, if P and Q map to P’ and Q’ respectively, then the length 
of PQ = length of P’Q’. Invariance is a very important and general 
concept in mathematics, and we shall have more to say about it in the 
remainder of this course and subsequently. In fact, we encountered 
invariance very early in the Foundation Course. When we are asked to 
solve an equation f(x) = 0 and we choose to do it iteratively, then we 
usually rearrange the equation in the form: 


x = F(x); 
that is, we look for the invariant elements under the function F. (See 
Unit 2, Errors and Accuracy.) 
Exercise 1 
Under the mapping 

f(x, y)'— (2x, 2y), 
in Example 2, we have seen that circles centred at the origin are not 
invariant. But some lines are invariant. Which lines? a 
Example 4 
Consider the mapping of R? to R? defined by 

(x,y) (—y, y). 


From the previous two examples, it seems that, to see the effect of the 
mapping, it is probably worth seeing what happens to a set of base vectors. 
Again, to simplify the arithmetic, we choose the base vectors (1,0) and 
(0, 1). We find that 


(1,0) (0, 0) 
(0, 1)—(-1, 
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Definition 1 


Detinition 2 


Exercise 1 
(1 minute) 


Example 4 


so that in terms of the corresponding geometric vectors i and i. we have 
that j maps to —{ + j, but i maps to the zero vector. 


t bf rel 
ae 
(io) x 
+ y y 
@) 


LE 


We shall look at this mapping in a little more detail. What happens to the 
x-axis? On this axis y is zero and so every point on the x-axis maps to 
(0,0): the entire x-axis shrinks into the origin. What about the line 
y = 1? Every point on this line maps to the point (—1, 1). 


¥ 
, 


In fact, the entire plane maps on to the line whose equation is 


x+y=0 


In terms of geometric vectors, every element in the image set can be 
represented in terms of the vector —{ + j. 


The image set has dimension 1, and so the effect of the mapping is to 
“lose” a dimension from our vector space. This is equivalent to saying 
that this mapping is many-one. If we start with a point, P say, in the plane 
and map it to a point Q on the line x + y = 0, then we cannot map back 
to the original point P. This is because the point Q on the line x + y = 0 
corresponds to the whole of the line parallel to the x-axis through P; this 
line is the equivalence class containing the element P. (In Unit 19, Rela- 
tions we showed that any many-one mapping defines an equivalence 
relation on its domain. This is an example. The relation p is (a, b) p (c,d) 
ifb=d.) a 
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(continued on page 6) 


Solution 1 
Straight lines which pass through the origin. a 


(continued from page 5) 


We have chosen these geometric examples to give you a visualization of 
the sort of mappings we are going to consider. One of the pleasant features 
of linear algebra is that by considering a geometric situation we can often 
throw light on non-geometric situations (and vice versa). Thus, a non- 
geometric analogue of the last case, where we “lost” a dimension, is 
provided by the following example, 


Example 5 
Let P; be the vector space of all polynomial functions of degree 3 or less. 
(This space has dimension 4, and a set of base vectors is given below.) 
The differentiation operator: 

D:p+— Dp (pe Ps) 


maps the space P, to the space P; (which has dimension 3). In this case 
we could take as a basis for P, the set of functions: 


foix-— 1 (xe R) 
<—— xe R) 
fick 7 eee Basis for P, 
exe (xe R) 
i (xe R). 


This set of functions maps to the set 
Loix-—0 (xe R) 
fix (xe R) 
[ix 2x (xe R) Basis for P; 
Lyx? (xe R). 


Like the previous example, this mapping is many-one. (For example, 
all the functions of the form: 


fix? ta (xe R), 
where ae R, map to the function /.) 


Note that /' is the zero vector in P,. It cannot belong to any basis for P,, 
since a basis must be a linearly independent set of three vectors. For 
consider the set { /,g, 4}, where g,h € P,. Since 


af + Og + 0b = fo, 
where « is any non-zero real number, we see that any set of three elements 
containing f is linearly dependent. a 


Exercise 2 


We seem to be pinning a lot of faith in choosing a convenient basis. Is the 
choice of basis important? We shall resolve this difficulty later, but one 
point can be considered here. 


In Example 5, instead of f, and f,, we could choose go and g,, where 
Sox — 1 + & (xe R) 
Bice — Fd eX, (xe R), 


and then none of the base vectors maps to the zero vector under D. Is 
{go. 81, fs} a basis for P,? Py 
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Solution 1 


Discussion 


Example 5 


Exercise 2 
(2 minutes) 


Exercise 3 


Fill in the gaps in the solution to the following problem. 


Problem 
If Tis a mapping from R? to R?, and 


Tes.— (23), 


describe the image of the ellipse 
{(x, y):9x? + 4y? = 36}. 


Solution of Problem 


Put u = 3 and v = 5 
Since x and y satisfy the equation 
9x? + 4y? = 36 


wand v satisfy the equation 


ie, 


{(x, y):9x? + 4y? = 36}-—> 


The image set corresponds to a circle centred at 


Exercise 4 


and with radius 


= 


(i) 


(ii) 


(iii) 


(i) By choosing a suitable basis and finding its image, or otherwise, 


describe the effect of the following mappings of R? to R?. 


(a) T, :(x, y)-—>(y, x) 


(b) T, :(x, y)#— 


ae ae 


(ii) Find an element in R? which is unchanged under the mapping T,. i 
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Exercise 3 
(2 minutes) 


Exercise 4 
(4 minutes) 
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Solution 2 Solution 2 
Zo:x-— 1 (xe R) 
ext (xe R). 
Although none of the base vectors is mapped to the zero vector, 
{8.81 £3} is linearly dependent, since 
Igo + ae, + Of3 = fo; 
to 


o and gi, cannot both belong to the same basis because one is a scalar 
multiple of the other. a 


Solution 3 Geilo 
(i) 9(2u)? + 40)? = 36 

(ii) we +0? =1 

(iii) {(u, v):u? + v? = 1} 

(iv) the origin; | 


Thus the effect of the mapping is to transform the ellipse to the circle as (See RB4) 
shown below. 


y 
3 y 
1 
Ce > ae 
a 


Solution 4 Solution 4 


(i) If we choose the basis {(1, 0), (0, 1)}, we have: 
(a) T,:(1,0)-—+ (0, 1) 
T, :(0, 1) —>(1, 0) 


y y 
{ocy): xy} 
{o) {o1) 
(jo) * (10) 


The effect of T; is to reflect the points of the plane in the line with 
equation y = x (we are simply interchanging the x and y co- 
ordinates). 


yt 
b) Z:(1, 0)'—+ |—=, —= 
(b) T,:(1, 0) (; 5 
ee 
:0,0—(—, } 


(on) ae | aw 


The effect of T; is to rotate the plane about the origin through an 
angle i (If you are not convinced, find the images of some more 


points.) 

(ii) Any scalar multiple of (1, 1) is an invariant element under T,. In terms 
of geometric vectors, any vector in the direction of the line with 
equation x = y is invariant. All such geometric vectors are scalar 
multiples of { + j. a 


23.1.2 Morphisms 


There are many interesting and useful results concerning mappings of 
vector spaces, but the most fruitful field of study consists of those mappings 
which are homomorphisms or isomorphisms (and therefore, of necessity, 
functions). 


If Vis a vector space, the two operations we have defined on the elements 
of V are addition of vectors and multiplication of a vector by a scalar. 
Suppose a function T maps a vector space V, with an addition operation 
+,, to a vector space U, with an addition operation +,. The additive 
structure will be preserved if, for any vectors x, and p, in V, 


T(vy +yb2) = TWvy) +, T(v2) 


Since we have been abusing the symbol + throughout this unit (we have 
defined all sorts of methods of addition and called them all +), we shall 
continue to do so, and we drop the suffices u and v from the addition 
symbols, We then have 


T(vy + Wa) = T(es) + T(va) 
as the condition that T should be a morphism for the addition operations. 
For the other operation we require that 

T(av) = xT(v), 
for any real number « and any vector pe V. 


Equations (1) and (2) are the conditions that T should be a morphism 
from the vector space V to the vector space U.* 


The two equations can be combined to give the following equation : 
T(ayvy + 02) = a, T(v,) + #2 T (v2), 
for any real numbers «,,«, and any vectors v, and p, € V. 


In many books, a mapping of a vector space to a vector space is called a 
transformation, and when the mapping is a morphism it is called a linear 
transformation. This is another example of calling a particular type of 
mapping by a special name. (We have already used the word operator.) 


* Previously (see Unit 3, Operations and Morphisms, page 25), we said that a morphism f was a 
mapping ftom A to f(A). Here, T(V) may be a proper subset of U. (In fact, if Tis a morphism, 
T(V) is a vector space, but we have not proved this yet.) 
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231.2 


‘Equation (1) 


Equation (2) 


Equation (3) 


Definition 1 


Exercise 1 


Which of the following mappings are morphisms? (Take the operations 
in the various vector spaces to be the usual ones.) 


(i) The mapping of R? to R? such that 
T:(%1, 2)" (2, x1) 

(ii) The mapping of R? to R? such that 
T(x, X2)+—— (xf, x3) 


(iii) The mapping of the set of all geometric vectors in a plane to R such 
that 


T:x'—4:x, 


where q is a given geometric vector, and the dot stands for the inner 
product as defined in Unit 22. 

(iv) The mapping of the set of all polynomial functions of degree n or less 
to itself such that 


T :p-—the derived function of p. 


(v) The mapping of the set of all real functions, which are twice-differ- 
entiable at all points in R, to the set of all real functions, such that 


Tf —> 2D4f + Df + 3f a 


Exercise 2 
Let L be a morphism from a vector space V to a vector space U, 


Complete the gaps in the proof of the following theorem. 


THEOREM. 


If the zero element of V is uo (i.e. Up is the element for which v + vo = v 
for any ve V), and if up is the zero element of U, then L(v9) = uo. 


PROOF 


Since p+ =v 


L(v + Uo) = if (a) 


But L is a morphism, so 


L(v + Uo) = 1 + u } (b) 


iy de 
From (and (ii), L(v) + L(vo) = Lv), 


so subtracting L(v) from both sides, we see that 


L(vo) is the element of U. (c) 


Confirm this result for each of the mappings which are morphisms in 
Exercise 1. a 


Exercise 2 shows that under a morphism the zero element in the domain 
vector space is mapped to the zero element in the codomain vector space. 
We shall use this result in proving the following theorem, which is funda- 
mental to the study of morphisms of vector spaces. 
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Exercise 1 
(3 minutes) 


Exercise 2 
(4 minutes) 


Main Text 


THEOREM 


If Lis a morphism from a vector space V to a vector space U, then L(V) 
is a subset of U which is itself'a vector space. 


(L(V) may be U or a proper subset of U.) 


LW) 


METHOD OF PROOF 


We have to prove that the set L(V), with the operations of the vector space 
U, satisfies the vector space axioms listed in Unit 22. To save you the 
trouble of referring back, we list the axioms for a vector space V; v, 0), 
2, U3 are any elements of V, and «and f are any real numbers: 

v) + v2 € V and is unique. 

Uy + (v2 + 3) = (Yr + L2) + B3 

Ry +22 = 22+ 

There is an element, vp, in V, such that 


ene 


Ut =vL 


5 weV 

6 w+ (—lp=to 

7 oft + v2) = (Ly) + (a2) 
8 («+ Be = av + Bo 

9 (aP)v = a(Bv) 
Wlxv=ev 


Axioms 2, 3, 6, 7.8.9 and 10 are statements about all elements of a vector 
space, and since U is a vector space we do not have to check these axioms 
for L(V). On the other hand, axiom 4 is a statement that a particular 
kind of element (the zero element) belongs to a vector space. Clearly, the 
zero element of U will not belong to every subset of U,so we have to prove 
that it belongs to the particular subset L(V). Axioms | and 5 concern 
closure. If L(V) is to be a vector space, then any combination of elements 
in L(V) must give resulting elements still in L(V), This again is not 
necessarily true for any subset of U, so we must check it for L(V). 


PROOF 


We have to prove that axioms 1, 4 and 5 hold for L(V). We have three 
pieces of information: 

(i) V is a vector space; 

(ii) U is a vector space; 
(iii) L is a morphism. 
We have used (ii) to dispose of axioms 2, 3, 6, 7, 8, 9 and 10, but we have not 
yet used (i) and (iii). 


We have proved that axiom 4 holds for L(V): in Exercise 2 we proved 
that under a morphism the zero element vp in the domain V maps to the 
zero element uy in the codomain U. So the zero element of U belongs to 
LV). 


Let us have a look at axiom 1; we must show that L(V) is closed under 
addition. If u, and u, are any elements of L(V), then there are elements 


Main Text 


(continued on page 13) 
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Solution 1 Solution 1 
(i) T((x15%2) + (V1+¥2)) = Ther + Yas %2 + Ya) 
= (X2 + Ya.X1 + 1) 
= (2,4) + (Yas ¥1) 
= T(x,,X2) + Ti, 92) 
and 


T(a(X, .X2)) = T(ax,, 4X2) 


= (0X2, 0X4) 
= o(X2,%4) 
= aT (x,,X2), 
so T is a morphism, 
(ii) @T(Xy , X2) = ox},.X3) = (ax7, x3), 


and 
T(x, ,X2)) = T(axy , 2) = (0°x7, 7x3) 


Equation (2) is not satisfied, so T is not a morphism. 


(iii) Tix + y)=a-(x + y) 
=4q:X +q-y(- is distributive over +) 
= T(x) + Tly) 
and 
Tag) = q+ ox 
= aq x) 
= aT(x) 


so T is a morphism. 
(iv) T is a morphism | These results follow directly from the properties 
(v) T isa morphism { of D. a 


Solution 2 Solution 2 


(a) v 

(b) 2, Lo 

(c) zero 

For the morphisms of Exercise 1, we have: 
(i) (0, 0)-— (0, 0) 

(iii) Q-— 0 

(iv) («+ 0) —> (x 0) 

(v) (x-—+ 0) (x + 0) 


You may have noticed that for many-one mappings, the zero vector is 
not the only vector which maps to the zero vector. For example, in (iv) 
we also have (x+—> k) mapping to (x-—+ 0), where k is any real number. 
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v, and v, in V such that (continued from page 11) 
uy = L{v,), 
4, = L{v,). 

Then 


My + Mp = Ley) + L(v2) 
= L(v, + v2) (because L is a morphism) 
= Lvs), 


where v, is an element of V (by axiom 1 for the vector space V); L(v,) is 
an element of L(V), so u; + uz belongs to L(V), and L(V) is closed. 


The other closure axiom, number 5, is easily checked. 


Lv), 


= 
i] 


then 
aL{v) 
L(av) (because L is a morphism). 


Therefore aue L(V). 
This completes the proof, 


& 
) 


If a subset of a vector space U is itself a vector space, then we call it a 
vector subspace of U. Definition 2 


Summary ‘Summary 


So far in the text, we have considered mappings of one vector space to 
another, and we have concentrated our attention on those mappings 
which are morphisms. A morphism has the property that the image set 
itself is a vector space: we have seen that the properties of commutativity, 
associativity and the zero element are carried over from the domain to its 
image set by a morphism. 


An interesting feature of some of the morphisms we have met is that they 
map a vector space on to an image set which has a lower dimension. For 
example, we have had mappings of planes to lines, polynomials of degree 
nor less to polynomials of degree n — | or less, and so on. Two questions 
arise. What has happened to the “lost” dimensions? Can we predict 
in advance when we are going to “‘lose’’ a dimension? We shall look at 
these questions in the next section, 


Exercise 3 Exercise 3 
(3 minutes) 


(i) The mapping from R? to R? defined by 
Li(xy, Xz)" (—x2,X2) 


is a morphism. Prove directly by verifying the axioms that L(R?) is 
a vector space. 
(ii) The mapping from R? to R? defined by 


T 3(x,,X2)-— (x, X2) 


is not a morphism. Show that T(R?) is not a vector space by finding 
an axiom which is not satisfied. 


(Note that we have not proved that if T:V-— U is not a morphism, 
then T(V) is not a vector space. For a general mapping T, we do not 
know anything about T(!’) if T is not a morphism.) a 


Solution 3 


(i) As in the proof of the theorem, we need to check axioms |, 4 and 5 
var J The elements of L(R?) are of the form 
(~a,a),a€ R, 
and 
(—a,a) + (—b, b) = (—(a + b), a + b), 
and so axiom 1 is satisfied. 
Axiom 4 Consider the element (0, 0), 
L((0, 0) = (0, 0), 
so that (0,0) € L(R?), and axiom 4 is satisfied. 
Axiom 5 (—a,a) is any element of L(R?) 
o —a, a) = (— aa, 2a), 


and so o(—a,a)€ L(R*): axiom 5 is satisfied, 
(ii) The only axioms which may not be satisfied are 1, 4 and 5, Of these 
only 5 is not satisfied, 


ox], X2) = (AT, 1X 2) 


and if « is negative, «xj is also negative, and so cannot be written as 
the square of a real number. a 


23.1.3 The Kernel 


Let us have another look at Example 23.1.1.4 in which we saw that the 
morphism 


L(x, y)—(—y,y) (Gy) eR?) 


maps the plane to a line, First, we looked at a particular basis, and saw 
that one of the base vectors mapped to the zero element (0, 0) in the codo- 
main, We then investigated the mapping by looking to see what happened 
to particular subsets of the plane. We saw that any line parallel to the 
x-axis mapped to a single point. 


This raises two questions. Firstly, is it significant that we lose one basis 
vector and we lose one dimension? Secondly, it is all very well in this 
simple case to pick out a few significant subsets that tell us such a lot. 
We picked them out because we knew their properties. Consider now the 
images of the lines parallel to the y-axis in the domain. Any such line 
maps to the entire image set, for suppose we take the line for which x = a, 
then 


L:(a, y)#—>+(—y.y) (ve R). 


y o 
ae \—e Se 
x x 
By considering certain subsets of the domain, we find that we can obtain 
information about L. Is there any particular subset which we can most 
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Solution 3 


profitably consider? That is, can we describe a subset in the domain 
which will give us information about L in a form which we can interpret 
easily? If so, can we extract any general feature which will help us with 
other examples? 


The clue is in our observation about the loss of a basis vector. The vector 
(1, 0) maps to (0, 0), but of course it is not only this vector which “shrinks” 
to zero—every multiple of (1,0) maps to (0, 0). (Because the mapping is a 
morphism, L(o(1,0)) = «L(1,0) = (0,0).) So a whole set maps to (0,0). 
Why consider this particular set? Try the next exercise. 


Exercise 1 


The vectors (0, 1) and (2,2) form a basis for R?. Calculate L(0, 1) and 
L(2, 2), where L is the mapping we have been discussing: 


Li(x,y)—(— yy) (ye R?). 
What happens to the linear independence of (0, 1) and (2, 2)? a 


Exercise | shows us that, in our original basis, the choice of a vector 
which mapped to (0,0) was purely fortuitous. (See page 4,) It may so 
happen, as in this exercise, that none of the base vectors maps to (0, 0), 
even though we “‘lose’’ a dimension. But here the whole set {(%, 0):%¢ R} 
maps to (0, 0), whether or not one of its elements is in the basis. 


It seems, then, that the set which maps to (0, 0) tells us something about 
the “lost” dimension. In this case the set which maps to (0, 0) has dimen- 
sion | (every element of the set can be obtained as a scalar multiple of 
(1, 0)), and we lose just one dimension. Let us have a look at two more 
examples, one where we again lose one dimension and one where we lose 
more than one dimension. We shall again take geometrical examples 
because geometrical situations are easy to visualize 


Example | 
The mapping 
L(x, y, 2) (% ¥, 0) 


is a morphism of R* to R*. The image of any point P is the point at the 
foot of the perpendicular from P to the plane with equation z = 0. Thus 
the domain maps to a plane, 


P(xy,z) 


(xy,0) 


This morphism maps a 3-dimensional space to a 2-dimensional space: 
we lose one dimension. The set which maps to the zero element (0, 0, 0) 
in the codomain is the set {(0,0, z):z€ R}, that is, the z-axis. This set is 
itself a vector space and its dimension is one, the same number as the 
number of “lost” dimensions. a 
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Exercise 1 
(1 minute) 


Discussion 


Example 1 


(continued on page 16) 
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Solution 1 Solution 1 
LO, 1) = (—1, 1) ¥ (0,0) 
and 
Z (2,2) = (—2, 2) # (0,0), 
but 


—2L(0, 1) + L(2, 2) = (0,0) 


Although neither vector maps to zero, the pair of linearly independent 
vectors maps to a pair of dependent vectors. So although the original 
vectors form a basis for R?, their images do not, i | 


(continued from page 15) 
Example 2 Example 2 
The mapping 

L(x, y, z)-—+(x, 2x, 0) 


is a morphism of R* to R*. The image of the point (x, y, z) depends only 
on its x-co-ordinate. Thus (1, 2, 3), (1, 6, 7), (1, 6, 99) all map to the point 
(1, 2,0), Every point in the plane with equation x = 1 maps to this point. 


F z 

gz. ‘ ‘ 
y (1,2,0) 

m 


x 


Similarly, every point on the plane with equation x = 2 maps to the point 
(2, 4,0)-and so on, Every plane perpendicular to the x-axis maps to a 
point on the line defined by the equations: 


2x-y=0 
z=0, 


and the entire three-dimensional space maps to this complete line. 


Thus 
LR*) = {(x, y, 2):2x — y = 0,2 = 0} 


The three-dimensional domain maps to a space of dimension 1. We 
seem to have lost two dimensions. 


Which set maps to the zero element? In this case the zero element is 
(0,0, 0), and the set which maps to (0,0,0) is the set {(x, y,z):x = 0}, 
i. the yz-plane; this is itself a vector space and has dimension two. 
Notice that in this simple case we can use the “basis argument” again —if 
we can find an appropriate basis. Taking the set of vectors {(1,0,0), 
(0, 1, 0), (0,0, 1)} as a basis, we see that both (0, 1,0) and (0,0, 1) map to 
(0, 0,0), and so we “lose” two base vectors. In fact we “lose” any vector 
which can be expressed as a linear combination of these two vectors 
(ie. the points in the plane with equation x = 0), because 


L(a(0, 1,0) + B(0, 0, 1)) 
= aL{0, 1,0) + BLO, 0, 1) = (0,0, 0), 
since L is a morphism. a 


We have seen that the subset of the domain which maps to the zero element 
in the codomain plays an important part, so we now give it a name. 


If Lis a morphism of a vector space V to a vector space U, and if uy is the 
zero clement in U, then the set 


{vive V, L(v) = uo} 
is called the kernel* of the morphism. (Another name which is incommon 


use for this set is the null space.) We shall denote the kernel by the letter 
K. 


There is one important point to notice here. In Unit 22, Linear Algebra I, 
we defined a basis of a vector space V to be a linearly independent set of 
vectors in V which spans V. We defined the dimension of V to be the num- 
ber of elements in a basis. Now the kernel of a morphism may only contain 
the zero element of V; i.e. we may have K = {0}. 


We write Q instead of up here, because you may like to refer to our dis- 
cussion of the vector space {Q} in section 22,2.2 of Unit 22. There, we proved 
that 


@ = 0, 

where « is any real number. This means that {0} is a linearly dependent set; 
that is, {0} does not possess a basis. We adopt the following definition. 
The dimension of the zero vector space, {0}, is zero. 
The kernel has some quite remarkable properties. We have already 
hinted at two of them which are printed in red below. 
(1) The kernel itself is a vector space. 

We have shown that L(V) is itself a vector space, and in Examples | 


and 2 we have seen that 
(2) (dimension of L(V)) = (dimension of V) — (dimension of kernel) 


The proof of (2) is given in the Appendix at the back of this text. The proof 
of (1) is not difficult ; we have set it as an exercise. 


* This is in accordance with the ordinary use of kernel for the nucleus or core of a structure. 
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Definition 1 


Nagin 


Definition 2 


Exercise 2 


Find the kernel of each of the following morphisms: 
(i) T (1X2, 3) (x1, X2,0) 

where T maps R® to R*. 
(ii) Ts, X2)-—> (& + X2,X4 — 2X9) 

where T maps R? to R?. 


In each case find the dimension of the kernel and verify statement 2 in 
the text. a 


Exercise 3 
Let L be a morphism from a vector space V to a vector space U. Show 
that the kernel of L is a vector subspace of V. 


(HINT: How many of the axioms of a vector space need proving for the 
kernel? (See section 23.1.2.)) a 


23.1.4 Some Properties of the Kernel 


It is remarkable how much we can tell about a morphism just by consider- 
ing its kernel, 


Let L be a morphism of a vector space V to a vector space U, and let K 
be its kernel. If ke K and ve V, then 


L(v + k) = L{v) + L(k) (Lis a morphism) 
= Lv) + uo (definition of K) 
= Liv) (axiom 4 for U), 


where uy is the zero element in U. So v and v + k, where k is any element 
of the kernel, have the same image, 

Suppose now that we want to find all the elements in V which map to a 
given element ye U, and that we know one such element p, i.e. 


Lv) =u 
Then we know immediately that vp + k, for all ke K, are such elements, 
and the remarkable thing is that they are in fact all the elements which 


map to u. We can prove this as follows. Suppose y, € V maps to u, ic, 
L(v,) = u. Then consider vp, + (—1)v. We have 


L(v, + (= Ye) = L(vy) + (— Lv) (Lisa morphism) 
=u+(—lu (by hypothesis) 
=Uo (axiom 6 for U). 
But the kernel K contains all those elements which map to up, so 
vy +(—le=k 


for some k, € K, By axiom | for V, ky is unique. Adding v to both sides, we 
get 


vy =vu+kh, 


and so Pp, is of the form v + some element of the kernel. This result has 
important consequences: for instance, if the kernel contains a finite 
number of elements, n say, then we know that exactly n elements of V 
map to any given element of L(V). In particular, if the kernel contains 
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Exercise 3 
(4 minutes) 
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just one element (which will necessarily be the zero element in V), then we 
know immediately that L is one-one, i.e. an isomorphism. The following 


examples-apply this discussion. 

Nims gyal 
Example | Example 1 
Apply the above ideas to finding the solution of the equations 


in terms of vector spaces. a 


Solution of Example 1 


One way of expressing the problem is to say that we want to find the set 
of triples (x, y, z) Which satisfy these equations. If L is the mapping from 
R® to R? defined by 


L3(x, y, 2) (2x + 3y — 2.x + y — 2), 
then we want to find the set which maps to (1, 2). 


We have seen that we can describe the set which maps to any particular 
element when we know the kernel and one element of the set. In the con- 
text of this example, this means that, if we can find one solution to the 
equations, we shall be able to find all the solutions, simply by adding to 
that solution each element of the kernel. So we have to find one solution 
and we have to find the kernel. 


One Solution 


If we give x or y or za particular value, then the equations will be reduced 
to two equations in two unknowns, which we can solve easily, 


For example, if we put z = 0, then we obtain the two equations 


2x + By = 
x+y=2 
which we can solve to give 


x=5, y=-3. 


So one solution of the original equations 
1 


2x + 3y — 


x+y-z= 


=0. (continued on page 21) 
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Solution 23.1.3.2 Solution 23.1.3.2 


(i) {(0,0,x,):x3€R}. Any element in this set is a scalar multiple of 
(0, 0, 1). 

(ii) {(x,,X2):x, + x2 = 0 and x, — 2x, = 0} 

The pair of simultaneous equations 


xy +x, =0 
x, — 2x, =0 
has the single solution x, = 0, x, = 0. Thus the kernel is the set {(0, 0)}. 


The dimensions of the kernels are as follows: 
(i) 1, The dimension of the domain is 3, the dimension of its image set 
is 2,and3 —2=1. 
(ii) 0, The dimension of both the domain and its image set is 2. Note that, 
by defining the dimension of {0} to be zero, we have ensured that (2) is 
satisfied when K = {0}. a 
Solution 23.1,3.3 Solution 23.1.3.3 


We need only prove that the elements of the kernel satisfy axioms 1, 
4 and 5. As usual, we denote the kernel by K, 


Axiom 1 If k, and ky belong to K, we want to prove that k, + k, eK. 
We recognize elements of K by the fact that under L they map to uo, 
the zero clement in U. 


Lk, + ko) = L(k,) + L(k2) (Lis a morphism) 
= Uo + Uo (ky, ka €K) 
= Uy (axiom 4 for U) 
Therefore, k, + ke K. 


Axiom 4 We have already shown (Exercise 23.1.2.2) that L(vo) = uo. 
Therefore vy € K. 


Axiom 5 Let ke K, then 


Lak) = aL(k) (L is a morphism) 
= Uy (ke K) 
= Uo 


This last step has been proved in Unit 22, Linear Algebra I, p. 40. The 
verification of axioms | and 5 can be amalgamated by using the definition 
of a morphism in the form given in Equation 23.1.2.3. | 


20 


The Kernel 


The kernel, K, is the set of triples (x, y, z) which map to (0,0), ie. which 
satisfy the equations 


2x + 3y — 


etsy 


Just as before, we can solve these equations by giving x or y or za particular 
value and then trying to solve the resulting two equations in two un- 
knowns; but this time it is not much help, because we simply get the one 
solution, and we want all solutions. But if, for example, we give z a general 
value and put z = k, then these equations become 


2x + 3y=k 


x+y=k, 
which we can solve to give 
x = 2k, y=-k. 


So one element of the kernel is (2k, —k, k), and by varying k we get all the 
elements. So K is the set 


{(2k, —k, k):k © R}. 


(5,-3,0) _ 


We want to find the red shaded set in the above diagram. 
We know one element — how do we find the others’? 


Any solution of the original equations 
2x+3y-z=1 
x+y—-z=2 
is obtained by adding an element of the kernel to (5, — 3, 0). 
So the complete solution set is 
{(5 + 2k, —3 —k, k):ke R}, 


and the theory we have developed assures us that this set contains all 
the possible solutions to the original equations. 


(Check that these are solutions by substituting into the original equations.) 
Notice that we can solve related problems like 


2x +3y-z=7 
x+y—2=99, 


where the right-hand sides of the equations are changed, very quickly — 
all we need to do is to find one particular solution and then add on each 
of the elements of the (same) kernel. We shall discuss problems of this 
type in considerable detail in Unit 26, Linear Algebra III. a 
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(contined from page 19) 


Exercise 1 
By putting x = 0, find a particular solution of the equations 
2x+ 3y —2=2 
x+y+z= 
Find the solution set of the equations. | 


Exercise 2 


We can map the vector space P,, of all polynomial functions of degree 
nor less, to itself by the differentiation operator : 


D:p-— p's (pe P,). 


We have already seen that this mapping is a morphism. What is the kernel? 
What significance does this have in integration? a 


Problems such as the following often arise in mathematics. We are given 
a mapping, M say, from a set A to a set B, and are required to find all the 
elements x of A such that 


M:x-—>b, 


where b is a given element of B. That is to say, we have to solve the equa- 
tion M(x) = b. Sometimes there is no solution (if b does not belong to 
M(A)); sometimes there is a unique solution; sometimes there are many 
solutions. In a very wide class of problems, A and B are vector spaces and 
Misa morphism. We have just seen that in these cases we can (in principle) 
adopt a standard procedure. We first find the kernel, K, that is, the solu- 
tion set of 


M(x) = 9. 


(This is usually a much easier problem than the original one.) We then 
find any one particular solution of 


M(x) = b, 


and then combine this with each element of K, to get the complete solution 
set. 


We shall apply these techniques later in the course, and we shall also 
meet the idea of a kernel in contexts other than that of a vector space. 
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Exercise 2 
(3 minutes) 


23.1.5 An Algebra of Morphisms 


We shall now use some of our earlier work on functions. In Unit 1, 
Functions we saw various ways in which functions could be combined. 
In particular, we saw that there are two distinct types of combination: 
composition, or combination by successive performance, and the “arith- 
metic” combinations which can be “induced” from operations defined 
in the codomain. In Unit J, we gave particular examples of this latter 
type. We saw that when the codomain is a set of real numbers we can use 
the operations of addition, multiplication and so on, to define addition of 
functions, multiplication of functions, and so on. 


If T, and T, are functions mapping a vector space V to a vector space U, 
then we have two vector space operations defined on U which can be 
used to define new functions of V to U; 


T, + Ty:u-—> T,(v) + Th(v) (veV) 
“T, :u'-— o( T,(v)) (eV) 


where « is a real number, 


Exercise | 

Show that, if 7, and T; are morphisms, then T, + T, and xT, are both 
morphisms. (If you find that you need to look up the solutions to this 
exercise, then do so, but the solutions will have little value if you only 


read them, because they are cluttered with symbols. So once you have 
read them, try writing out at least one part on your own.) a 


Exercise 2 


Given two vector spaces V and U, we can consider the set of all morphisms 
from V to U, It is a remarkable fact that this set of functions with the 
operations we have just defined is itself a vector space. Prove this state- 
ment. 


(You will need to check each of the axioms of a vector space. They are 
listed on page 11.) a 


Finally, we shall show* that, if T, is a morphism of the vector space V 
to the vector space U, and T; is a morphism of U to the vector space W, 
then T, e T, isa morphism of V to W. With the usual notation, we have 


Tye Ty(0,0) + %qv2) = Tr(Ti(ot,0, + a2) 


= Tyo, T(vs) + 2% Ty(v2)) 
(since T, is a morphism) 


= % T(T,(v,)) + % T(T;(v2)) 
(since T, is a morphism) 


= mT, © T\(v1) + %2T3 © T\(v2) 
ie. Equation 23.1.2.3 holds, so T, © T, is a morphism. 


* What we prove here is a special case of tite general result that the composition of two 
morphisms is a morphism. The only reason we prove it here is because we have not proved 
it before in the course : we could have proved it earlier in Unit 3, Operations and Morphisms. 
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Solution 23.1.4.1 - 
Putting x = 0 in both equations, gives 
3y-z= 
y+tz=1, 


which have the solution 


Thus one solution is 
x=Oy=9,z=4. 
To find the kernel, we have to solve the equations 
2x +3y-—z= 
x+y+z=0 
If we put x = k, we get 
3y —z = —2k 
y+z=-k, 
which have the solution 
y= hz = —tk; 
so the kernel is the set 
{(k, —3k, —4k):ke R}. 
The complete solution is therefore 
{(k,3 — 2k4— 4k)ke Rh. 
(You may recall from Unit 15, Differentiation II that the equations 


2x + 3y —z=2 and x + y+2z=1 are equations of planes. The set 
of points 


{(k,3 — 3k,4 — 4k):ke R} 
corresponds to the line formed by the intersection of these two planes.) 


Solution 23.1.4.2 


The kernel is the set of all polynomial functions which map to the zero 
function (x-—> 0(x € R)); that is, the set of all constant functions, 


{f:xt—>k (xe R), ke R}. 

The problem of integration is that of solving equations of the form 
Dip) = f, 

where f is given. 


Since the kernel contains an infinite number of elements, the integration 
problem has an infinite number of solutions. If p is one solution, then the 
set of all solutions is 


(p+ Sif:x-—+ kB R), ke R}, 7 
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Solution 23.1.4.1 


Solution 23.1.4.2 


Solution 1 
Let T= 7, + Th. 
Tu + ve) =(T, + Th)(u + v) 


W 


T,(u + v) + Tau + v) (definition of T, + T,) 


T,(u) + T,(v) + TW) + T,(v) 
(7, and T, are morphisms) 


(Tw) + T,(w)) + (T\(v) + Tv) (axioms 2 and 3) 
=(T, + T,)(u) + (T, + T2)(v) (definition of T, + T) 
= Tu) + T(v) 


Thus the first condition for a morphism is satisfied. Now let « be any real 
number. 


T(au) = (T, + T,)(ou) 


= Ty(au) + T,(ou) (definition of T, + Ty) 

= aT,(u) + #Th(u) (T, and T, are morphisms) 

= oT, + T)(u) (axiom 7 and definition of T, + Ty) 
= oT(u) 


Thus the second condition is also satisfied: T is a morphism. (The proof 
could be shortened by using the definition of a morphism given by 
Equation 23.1.2.3, and showing that 


T(a Uy + a2) = %,T(uy) + %2T(Ua).) 
We also have to prove that «7; is a morphism. We use Equation 23.1.2.3. 
ATi (2,Uy + pM) = ox Ty (Wy + %2M2)) (by definition of «T,) 
= ow, T,(U;) + %T,(u2)) (7, is amorphism) 
= (%)T,(u,) + (aa2)T,(u2) (axioms 7 and 9) 
= 04(@T;)(s) + (27, )(u2) (axiom 9) 


as required. a 


Solution 2 


Closure under addition and multiplication by a scalar have been proved 
in Exercise | (axioms 1 and 5), 


The zero mapping 
O:v-—*ug (vee V) 
belongs to the set because it is a morphism of V to U (axiom 4). 


The other axioms may be investigated directly from the definition of 
addition of functions and the definition of multiplication of a function 
by a scalar. 


It is important in this proof to be very clear about the equality of two 
functions; two functions are equal if they have the same domain and the 
same image for each element of the domain. & 
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Solution 2 


23.1.6 Summary 


We have concentrated our attention on those mappings of vector spaces 
which are morphisms. Under a morphism the image of a vector space is 
itself a vector space. A particularly important subset of the domain of 
such morphisms is the set of elements which map to the zero vector in the 
codomain. We call this set of elements the kernel of the morphism, and 
it has some remarkable properties. It is itself a vector space ; its dimension 
tells us the difference between the dimensions of the domain and the 
image space; it tells us whether the morphism is many-one or one-one; 
it provides us with a method of solving equations such as L(v) = u, 
where L is a morphism of a vector space. If L is a homomorphism then 
it is, by definition, a many-one mapping, and so we can expect this equa- 
tion to have more than one solution. 


We saw that if we know the kernel of the morphism and if we know one 
solution of the equation L(v) = u, then we can easily generate every 
other solution by using the elements of the kernel. We shall see later in 
this course how this method has direct application to the solution of 
problems in the theory of linear equations, in complex numbers and in 
differential equations, and how an analogous method can be developed 
in group theory. We also showed that, given two vector spaces V and U, 
the set of all morphisms from V to U is itself a vector space. Finally, we 
showed that the composition of two (suitably detined) morphisms of 
vector spaces is itself a morphism. 


These results are listed below. 


Let L be a morphism from a vector space V to a vector space U. Then, 
(i) L(V) is a vector subspace of U; 
(ii) K = {ke V:L(k) = uo} is a vector subspace of V, called the kernel 
of the morphism ; 
(iii) (dimension of V) = (dimension of L(V)) + (dimension of K); 
(iv) if v, is a solution of L(y) = (y), then the set of all solutions of this 
equation is {v, + k:ke K}: 
(v) K = {vo} <* L is an isomorphism; 
(vi) the set of all morphisms from V to U is a vector space ; 
(vii) if Ly is a morphism from U to a vector space W, then L, o L is a 
morphism from V to W. 


23.2 MATRICES 
23.2.0 Introduction 


In this second part of the text we introduce a new idea — that of a matrix. 
A matrix is simply a rectangular (or square) array of numbers such as 
you might see on a bus timetable or on the board outside a cinema, 
showing the starting times of the films. Such arrays of numbers arise 
when we try to quantify almost any investigation. 


For example, the following two arrays might represent a survey of the 
voting inclinations in two constituencies. 


Party X Party Y Party Z Don't Know 
Men 30% 35% 10% 25% 
Women 2% 50%, 15% 10%, 

Party X Party Y Party Z Don't Know 
Men 50% 40%, 5% 3% 
Women 40% 40%, 2% 18% 


There are various ways of combining arrays. In this example, adding 
corresponding entries and dividing by 2 to give 


Party X Party Y Party Z Don't Know 
Men 40% 374% 14% 15% 
Women 324% 45% 83% 14% 


might be a meaningful thing to do, to give a sort of “average” voting 
inclination. 

In the first section of this second part of the text we show how morphisms 
between vector spaces can be represented by matrices. This correspond- 
ence between mappings and matrices leads us to define some ways of 
combining matrices. We may want to define special combinations for 


special applications, but the ones we define here are the most widely 
used, 


(There is a lot of algebraic and numerical manipulation in section 23.2. 
If you are unfamiliar with matrices, then it will not help you much just 
to read through the calculations, We advise you to work through all the 
calculations as you come to them, and just use the manipulations given 
in the text as a check for your own.) 


23.2.1 Linear Equations 


In the first part of this text we discussed several examples of mappings 
from R* to R? or R* to R®, and we saw in section 23.1.4 that these mappings 
are connected with simultaneous equations. It is fairly clear that three 
equations, each expressing one of x/, x5 and x4 in terms of x,, x, and x5, 
will tell us how to map (x,, x2, x) to (x4, x4, x4). If these equations hold 
for all real values of x,, x), x3, then they will define a mapping from R* 
to R®. For example, the three equations 


x4 = sin x, 
x} = sin x2 
x4 = sin x3 
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Equations (1) 


define such a mapping, as do the equations 
X, =X, +%2+ X53 

xg = 2x, — X% 

x4 = 2x2 + 3x3 


where the right-hand sides are linear combinations of x,, x, and x3. 


Exercise 1 
Do Equations (1) define a morphism from R* to R°? 
Do Equations (2) define a morphism from R* to R*? r | 


It seems that any set of three equations of the form: 


X= ayX, + GgX2 + 45x; 


X= byxy + byxz + b3x3 
X4 = CyXy + CgX2 + C3X3 


where the a’s, b’s and c’s are real numbers, defines a morphism from R* 
to R®. A set of equations of this form is called a set of linear equations, and 
our suggestion can be put in more general terms as in the following 
proposition, 


Any set of m linear equations, each expressing one of m variables 
X4.X4,+...¥, in terms of n variables x1, X2,..+4Xq 

defines a morphism of R" to R”. 

The proposition is easily confirmed, for if the equations are 


X= ayXy + AgXy $8 + AX, 


W 


X= Dyxy + bgXq +++ + DyXy 


Xm = GiX1 + daXq + + WnXn 


they certainly define a mapping T of R" to R"; 


Pi[Xey5 F401 Xe) — > (Rives Xml 


For the variables x,, x2 ,»X, can each take any real value, and so 
,) can be any element of R". We have to prove that T is a 
that 


T(x, OXg p++ +5 OXp) = AT (% 1, X250 00s Xy) 
where « is any real number, and 
TX, + a9 Xa + Yas 0205 Xe + Ip) 


= T (Xp 1X 25060 Xq) FTV ps Vareeee 


Exercise 2 

Prove that T is a morphism by proving the last two statements in the 
text. a 
We thus have the result that every set of linear equations of the form of 
Equations (4) defines a morphism. 


This in itself would not be so interesting, if it were not for the fact that we 
can represent any morphism from R" to R™ by such a set of equations. 
We can do this as follows. 
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Equations (2) 


Exercise 1 
(2 minutes) 


Equations (4) 


Exercise 2 
(2 minutes) 


FM 23.2.1 


We take as a basis for R" the set of n vectors: Discussion 


€, = (1,0,0,0,...,0), 
B= 1074, 070) 1.105 Dk 
e; = (0,0, 1,0,...,0), 


ey = (0,0,0,0,..., 1). 


An arbitrary element of R" can be written: 


B= (Xp Xe Ngee ees Xa) = XyOy + NCQ + XC He + XC ne 


If Tis a morphism from R" to R™, then 


T(x) = Tigres + X22 + +++ + Xy@y) = XyT(e,) + -++ + XTC). 


We see that the base vectors tell us the whole story. Each of the base 
vectors has an image in R"; suppose 


Then 


then 


(The basis {g,, ¢2,. 


Tey) = (4, by 55--5 41) 
Tez) = (az, b2,..+542) 


TEC) = (Aye Byy «5 Gu) 


T(x) = = X,(ay,by,.--544) 
+ Xp(dz.b2, 266543) 


F Xp(Gys Day s+ a) 
= (GX + AgXz H+ + AyX yy DyXy Hee + DyXqye ede 


TS) 8 Te 4p Xai = (Ny Mosca mead 


T(x) = x4€, + X2€2 +++ + Xmen 


+£m} for R" is shown on page 41, written in terms 


of column vectors, instead of row vectors as we have here.) 


We have 
XY = GyXy + 2X2 + +++ + aX, 
x4 = bx, + b. Fe a} 
Xd 71% H+ bgxq + 00+ + OyXp Equations (5) 
Xm = GX, + GaX2 +++ + Xn AP 
anshhso” 


Thus, the morphism can be represented by Equations (5), 
fm 


mM 


There is a one-one correspondence between sets offlinear equations(and Theorem 


morphisms from R" to R”. Any set of linear equations of the form (5) 


defines a morphism from R" to R", and any morphism from R" to R™ 
defines a set of linear equations of the form (5). 


We mentioned earlier that a morphism between vector spaces is often Discussion 


called a linear transformation. We now see why — the transformation 


refers to the mapping part, and linear refers to the properties we have 
just discussed. It is from here also that we get the term Linear Algebra. 


In Unit 26, Linear Algebra 111, we shall take up the topic of sets of linear 
equations in much more detail. In the meantime we shall use the ideas 
we have just been discussing to introduce the topic of matrices, which is 
of great importance in Linear Algebra and mathematics generally, and 
which we shall use when we discuss linear equations in detail. 
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Solution 1 


Let x = (x,,X2,%3) and y = () V3). 


For Equations (1) we have the mapping S, where 
S(x) = (sin x,, sin x2, sin x4) 
S(y) = (sin y,, sin y2, sin y3) 
S(x + y) = (sin (x, + y,), Sin (x2 + Y2), sin (x; + Y3)) 
and 
S(x) + Sly) = (sin x, + sin y,, sin x, + sin y,, sin x, + sin 5) 
and so 
Six + y) # S(x) + SQ). 
For Equations (2) we have the mapping T, where 


T(x) = (x1 + X2 + X3,2%; — X2,2%2 + 3x3) 


and 
T(x + y) = [xp + yn +2 +2 + Xs + Vos 
x, + Yi) — (2 + Yad 
xz + Y2) + Mxy + Ys)| 
= T(x) + TY) 
Similarly 


T(ax) = aT (x) 


where « is any real number. 
Thus Equations (2) define a morphism, but Equations (1) do not. a 


Solution 2 
The first statement follows from the fact that, for example, 

Ay (4X1) + Ag(4X2) + +++ + A,(HX,) 

= 0(€,X, + GgX2 + +++ + ayX,). 

a process which can be applied to each of the equations. 
Similarly, we have, for example, 

Ay (Xp + V1) + Ag(X2 + Va) + 20> + A(X, + Yn) 

(GX + agX2 Fo + AyXy) + (Gi, + GaV2 + e+ + AnD) 


The second statement follows from an application of this process to each 
component of T(x, + ¥,,-+++% + Va: a 


23.2.2 Arrays of Numbers 


We have seen that Equations 23.2.1.5 define a mapping from R" to R™ 
(given bases for R" and R™). Now (x1, X2,++++X%,) aNd (X41, 4,-6.6, Xn) 
are simply arbitrary elements in the respective vector spaces ; the equations 
themselves are adequately specified by the array of numbers 


@ Gz *** ay 
By by eB 
1 42 + We 


Such an array of numbers is called a matrix. 


30 


Solution 2 


23.2.2 


Notation 


Often we want to talk about this array as an entity in itself, as opposed to 
a collection of elements. To do this we either put parentheses round the 
array or refer to it by a single letter. Sometimes we do both, thus we write 


fa, az --+ a\ = 
Aot by Ba Be m rows 
‘di 2 Qn} 
‘ime a T 
n columns 


Thus, a matrix with m rows and n columns defines a mapping from R" 
to R" (the bases being given or understood), and the mapping is generally 
referred to by the name of the matrix. 


Example 1 

(i) The mapping of R? to R? defined by 
x! = 2x + 3y 
y =3x-y 


is represented by the matrix 
2 3 
(; = i] 
(ii) The matrix 
eee te) 
( 0 : 
represents the mapping of R® to R* defined by 
xX, =x, + 2x, + 3x, 
x, = 4x, + Ox, + 6x5 | 
Two matrices are said to be equal if they represent the same mapping 
(with respect to bases which are given or understood). Thus matrices A 
and B are equal if they have the same number of rows and columns (they 
must represent mappings with the same domain and codomain), and the 


corresponding elements are equal. If A and B are equal, we write A = B. 
For example, 


Gd-6 4 
“ld 


6 8 
1 eae | 1 3 4! 
8 00 ‘ -( G7) ‘ 
23) Ait fh 2 sok 
( 00 ‘ 


wv 


Since a matrix is an alternative way of specifying a morphism, we should 
be able to tell something about the morphism just by looking at the matrix. 
Investigating the properties of matrices is equivalent to investigating 
morphisms between vector spaces. 
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Example 1 


Definition 2 


23.2.3 Combining Matrices 


We can represent a set of equations by a matrix, but we can also go one 
better and rewrite the equations in terms of this matrix. Consider, for 
example, the equations 


xX, = 3x, + 2x2 + Ix; 


xg = 1x, + Lx, + 3x; 

They specify a mapping from R* to R? under which 
(X41, %34X3)*——> (X44), 

and are represented by the matrix 
Fa (' = ‘ 

PhS 

We can write the equations in the following way: 

(x44) = Alxy,X2,%3), 


but there are plenty of other ways in which we could write them, We shall 
choose a way which leads to results consistent with the existing literature 
on matrices. We write 


s 
Xt 


(x4,.X5) as 


and 
xy 
(X45 X2.N3) AS |X ]s 
x3 
that is, we write the elements of R? and R® as arrays (or lists) of numbers, 
rather than in co-ordinate form. These lists are just special kinds of 


matrices, matrices with one column. So we can rewrite the equations in 
matrix notation as 


where ( is a combination of matrices which we have to define to make 
Equation (2) equivalent to Equations (1), 


[x1 


From our definition of equality of matrices, A [ |x, | must be a matrix 


X3/ 
with one column and two rows, and the first element must be x‘, and 
the second x3. But x‘, and x4 are given by Equations (1), so 


ce 
a 3x, + 2x, es 
x2|= 
is Xy + Xz + 3x5, 
N3 


Thus 


Ceo | Wal ta n  o 
Oi] xa] = 
Ix, + Lx, + 3x, 
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Equations (1) 


Equation (2) 


and for a general mapping from R* to R?: 


a bc 


Aji 


© Nda + ep + fy 


i|- ax + bB + cy 


The Jirst element in the third matrix comes from the first row of the first 
matrix, and is obtained by multiplying each element of that row with the 
corresponding element in the second matrix and adding together the 
products thus formed, The second element comes from the second row 
of the first matrix by applying the same procedure. 


Similarly, 


rect 
| te ) 2}=( lx 1l+2x(-2)+3x3 
23 - a] MH2) x 143 x (-2) + (—4) x 3 


i (7 


We can extend this operation to deal with longer lists and matrices with 
more rows and columns. This extension corresponds to manipulating 
equations specifying mappings from R” to R", where the column corres- 
ponding to an element of R" has n elements 


x lay az oe @, 


and the left-hand matrix is m rows 


Xn Ma, 8s, £4 Gy, 


This matrix must, of course, have the same number of columns as there 
are elements in the list matrix corresponding to an element of R", because 
of the form of the linear equations. 


Example | 

W(t) 23 1 Ix 1l+2x(-1)+3x2 5 
: 5 6/0 -)- [ext+sx(n+6x2]- -(1| 
ca 2 7x 148 x (-1)+9 x 2/ 17 


(ii) 


+ Jo) -(9 


(iii) The set of equations 


2x, + 3x, = 2 
Ix, — 2x, =3 
3x, + 1x, =4 


can be written in the form 


1 3 


‘Jal: -(} 
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Exercise 1 
Calculate 
"Ce ol of423 
2 0 

o Z5 (ii) ree Oo 
3.4 


one 


At the end of section 23.1.5, we proved that the composition of two 
morphisms, their sum, and the scalar multiple of a morphism are all 
morphisms. It follows that we can define combinations of matrices 
corresponding to each of these combinations of morphisms. Here we shall 
consider the combination which corresponds to composition. We shall 
consider the other cases in section 23.2.5. 


Consider, for example, the morphisms T, and T; of R? to R* defined by 
Ty (x1, X2)-—+ (1x, + 1x2, 2x, + 1x) 
Ta !(X1, X)—+ (1x, + 2x2, 3x, + 2x2) 

Under the composite mapping T, ° T, we have 


Ty © T,:(X1 5X2) TT (X), X2)) 


and 
T,(T,(X1, X2)) = Ty(1x, + 1x2, 2x, + 1x3) 
= (I(x + X2) + 22x, + X), AX, + Xq) 
+ 22x, + x2)) 
= (5x, + 3x3, 7x, + 5x3) 
Writing 
Tix, (x4.x4) and — Ty(x4.,.x4) = (x7, x3), 
we have: 
x, = Ix, + Ixy 
x4 = 2x, + Ix, 
and 


xf = Ix, + 2x4 
xy = 3x, + 2x5 
Substituting from Equations (3) into Equations (4), we get 
xf = 5x, + 3x2 
xg = 7x, + 5xz 
as before. 


In terms of matrices, the matrix representation for T, is 
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Exercise I 
(2 minutes) 


Discussion 


Equations (3) 


Equations (4) 


Equations (5) 


(see Equations (4)). And we have shown that the matrix representation for 
Tze T; is 


(see Equations (5)). 


If we use the symbol * to represent the operation of combining matrices 
which corresponds to the composition of mappings, we have 


A,*A, =A; 


Goad db J 
* = 
3 2) 2 1 7 5, 
We now know how to combine A, and 4, in this case, but how can we 


work out B* A for any matrices B and A? Indeed, does B * A make sense 
for any matrices B and A? 


First of all, how do we calculate B* A? Looking at Equation (6), we see 
that the element in the first row and first column of A, is 5, and 


S=1x1+2x2, 


and this expression is obtained by multiplying the elements in the first 
row of A, by the corresponding elements in the first column of A, and 
adding together the products, 


foil 


Combining rows of A, with columns of A, in this way, we get the other 
elements of Ag 


(i 


eee 


In the general case of matrices with two rows and two columns, we define * 


by 
‘ ‘) -(’ ‘ o c + fe eb aa 
g hl \c d) \gat+he gh + hdl 
Exercise 2 


xX = ax, + bx, 
X$ = cx, + dx, i) 
and 
XT = ext + fx 
” , , (ii) 
x5 = gx} + Ixy 


find x{ and x4 in terms of x, and x,, and compare your answer with the 
definition of *. P| 
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Equation (6) 


Exercise 2 
(2 minutes) 


(continued on page 36) 
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Solution 1 Solution 1 
(i) [5\, because 1 x 3 + (—1) x (-2)=5 
6] and 2x3+0 x (—2) =6 
1/ and 3x34+4x(-2) =1 
(ii) (14), because 1 x 1+2x2+3x3 =14 
(it) and (—1)x1+0x24+4x3=11 | | 
Solution 2 Solution 2 


Substituting from (i) into (ii), we get 
xT = e(ax, + bxz) + f(ex, + dx) 
XZ = g(ax, + bx2) + h(ex, + dx) 


x = (ea + fo)x, + (eb + fd)xz 
we (ga + he)x, + (gh + hd)xz 

and the matrix representing these equations is 
ea+ fe eb + fd 


ga+he gh + hd| a 
(continued from page 35) 
We can extend the definition of * to more general matrices. So far we have Discussion 


combined matrices with two rows and two columns — corresponding to 
combining a mapping from R? to R? with another mapping from R? to R?. 
It makes sense to combine mappings other than these; a morphism 7, 
from R” to R?, say, can be followed by a morphism 7; from R” to R”, say. 


We know that T, will be represented by a matrix with m columns and p 
rows, and T, will be represented by a matrix with p columns and n rows. 
So we can extend the definition of * to form B* A for all sorts of matrices 
provided the number of columns in B is the same as the number of rows in A. 
Since T; « T, isa morphism from R" to R", the combined matrix will have n 
rows and m columns, and in general B+ A will be a matrix with the same 
number of rows as B and the same number of columns as A. 


P m m 
columns columns columns’ 


nl Pp n 
rows TOWS TOWS 


We shall not follow through all the details, but illustrate how we might 
proceed in a simple example. 
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Example 2 
Wes 

IifB=|3 4] and a=(). 
26 i 


how do we calculate B* A? A represents the mapping from R! to R? 
specified by the equations 


x, = Ixy 
x4 = 2x) 


and B represents the mapping from R? to R® specified by the equations 
xt = Lx + 2x5 
xy = 3x, + 4x4 
x5 = Sx, + 6x4 
Substituting Equations (7) into Equations (8), we find that the composite 
mapping has matrix 
5 
BeA={11 
17 
Thisis a matrix with three rows and one column ; we expect this because the 


result of mapping R' to R? (matrix A) and then R? to R3 (matrix B) is a 
mapping from R'! to R° (matrix B * A). 


The elements in the matrix B* A are calculated by exactly the same 
method as before: the element 5 comes from the first row of B and the first 
(and only) column of 4:5 = 1 x 1 +2 x 2. The element 11 comes from 
the second row of Band the first (and only) column of A:11 =3 x 1+4 x 2, 
And 17=5x1+6x 2. a 


If you look back at page 32 where we discussed the operation 1, you will 
see that ( is just a special case of *. We can define * for general matrices 
(to correspond to the composition of morphisms) as follows. 


The combination B * A is defined for any matrices B and A, provided that 
the number of columns in B is the same as the number of rows in A. If 


B*A=C, 


then the element in row i and column j of C is obtained from row i of B 
and column jof A ; itis calculated by adding together the termwise products 
of the elements in this row and column, 
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Equations (7) 


Equations (8) 


Definition 2 


rowi 


Bix + Batts + Byts + +--+ + By ty 


If B has m rows and p columns and A has p rows and n columns, then 
C = B* A will have m rows and n columns. 


Example 3 
(Make sure you can obtain these results for yourself.) 
(i) [1 | iP es i 2 7) 6 
—| a -1 -2 Ij=([—4 —9 7 
rier av =r 
(fk 2 3 4 Sje ft 
1 
1 | =(15) 
1 
1 
(iii) t =2 1 3-4 
ih 3)" t—1 —2 1 Jis not defined, because 
2 4 1 2 3 
the number of columns in the left-hand matrix is not the same as the number 
of rows in the right-hand matrix. | 
Exercise 3 
(i) al 1 ; fs ( ce 
=1 3 2 1 
(ii) Calculate | 1 3 ‘} 1 1 
-1 2 =i, 2 if 
OY = 
(iii) Is * (a) commutative? 
(b) associative? a 
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Example 3 


Exercise 3 
(3 minutes) 


23.2.4 Some Special Matrices 


Rotation Matrices 


In the first part of this text we had several examples of morphisms specified 
by sets of equations, and we discussed their geometric interpretation. For 
example, we saw that the equations 


x = —X2 = Ox, + (—1)xy 


X= xXy=Ix,+ Ox, 


can be regarded as specifying a mapping which rotates the plane through 
an angle a anti-clockwise about the origin. In terms of matrices, we can 


0 
say that 


anti- 
1 


‘} represents a rotation of the plane through 


mia 


clockwise. 

Suppose we start the other way round and look for matrices to do some 
specified geometric job. 

Example | 


Consider a mapping which rotates the plane about the origin through an 
angle () anti-clockwise. 


Xo 


Xp 
(xy. x9) (4,3), a/ 
Xy 7 x} 


If we call the corresponding matrix Rp. then 
(2) - (a) 
xy x) 
where x). x», x4, and x’ are related as in the diagram. We can find R, easily 
1\ [0 
by considering the basis {{') (')} for R?. 


A little trigonometry tells us that 


1 ns 
sin 0. 


() ye 


Xe x2 


P’(cos 8,sin8) 


P10) > : 


x ” 


x» 
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Example 


(See RB2) 


(continued on page 40) 
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Solution 23.2.3.3 Solution 23.2.3.3 
(i) [7 1 
fd 
(ii) [7 2 
(} ’ 


(iii) (a) It is easy enough to find a counter-example to show that * is not 
commutative. For example, 


( el? = (° ‘} 
oo \1 of \o 0 
and 
Carl d-(i 
1o \o o ti o 
(b) This is more difficult, because * is in fact, associative. 


A direct proof for * could be tricky, but we can go about it another 
way. Remembering that * for matrices corresponds to composition 
of morphisms, which are just special kinds of functions, we can 
show that the composition of functions (where it is possible) is 
associative. Suppose A, B, C, D are sets and f,g, hare functions such 
that 


SiA— Bg: Bt — Ch: C- > D 
then if ae A, 
((heg)e f)(a) = (he g)(f(a)) 
= hig(f(a))) 
= hig ° f)(a)) 
= (he(ge f))(a) 


Thus, composition of functions is associative, which implies that 
* is also associative. sg 


(continued from page 39) 
(' —sin ‘) 
and| |-— 
1 cos 0, 


ur] xy 
ao) (-sin8,cos8)Q" 
xy > xy 


R, has two rows and two columns because the mapping is one-one and 
the image set of R? is R*. Thus if 


n=(" ‘ 
al 


we know that 


tall) ~ (al 


and 
(° ‘) -() _ {-sin ‘} 
ed 1 cos @) 
Since 
C=C) 
+[ |= 
ec dl \0 c 
and 
aC) 
+([ ]= 
ed 1 d. 
we find that 
a=cos(, ¢=sin@, b= —sin@andd = cos. 
Thus 


i 0 —sin ‘) 

*™ \sin 0 cos 0) a 

You may have noticed in this example that the columns of R, are precisely 
1 0 

the images of the base vectors (; and (’ under the mapping. This is not 


special to this example, for it is easily seen that if A is the matrix of a 
morphism T from R”, then the columns of A are precisely 


Tey, Tey, Teyy.ses Ten. 
where @,,€).@5..-.,¢,, are the particular base vectors 
1 0 0 
0 1 0 
€:=| 0 fe. =| 0 ],...@m = | 0 
0 0 1 


(Compare this with the discussion on page 29,) 


The subject of Linear Algebra is full of elegant yet simple results like this, 


Exercise 1 


Find the matrix which corresponds to a “stretching of the plane”, so that 
the distance of every point from the origin is multiplied by a number k. 


Exercise 2 

Find the matrix which corresponds to the identity mapping, which leaves 
every point in the plane unchanged. i | 
The Identity Matrix 


Under the operation of composition of functions, the identity function 
leaves every function unchanged. For example, the function: 


fxr—Fx (xe R) 


issuch that fe g = ge f = gforany function g with domain and codomain 
R. This property has its counterpart in matrix algebra. If we write 


1 0) 
l= (; as then 1* A = A*J = A for any matrix with two rows and 


4) 
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Exercise 1 
(1 minute) 


Exercise 2 
(1 minute) 


_ (continued on page 42) 


Solution 1 


“ 1 k 0 0) = Pere ual 
Since ie) and >|}, the matrix required is a | 
0, 0, if k 0k, 


Solution 2 
Pee bine) 
The matrix is 5 
ol 


(The special case of Solution 1 when k = 1.) a 


(continued from page 41) 


two columns. Indeed, if we write J,,,, for the matrix with m rows and m 
columns, such that every entry is zero except for 1’s as the diagonal entries 
from the top left-hand corner to the bottom right-hand corner, then if A is 
any matrix with m rows and n columns, we have 

Inm* A = A* Inn = A 


Usually, the suffices are dropped, and we just write J for the identity 
matrix appropriate to the particular situation. 


Exercise 3 (Optional) 


You may find the following interesting. 


(i) What is the result of rotating the plane anti-clockwise about the origin 
through an angle « and then through an angle p? 
(ii) Complete the following equation : 


Ry * Ry 


(iii) Complete the following equation 


cos (B + a) 


Ryse= 


(iv) Write down R, and Ry and calculate R, * R, directly. 
(v) Compare your answers to (iii) and (iv). Can you deduce anything from 
this comparison? 
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Solution 1 


Solution 2 


Identity 


Exercise 3 
(3 minutes) 


23.2.5 More Ways of Combi 


The operation * arose from our interpretation of the composition of 
morphisms in terms of matrices, and was made possible because we had 
established the correspondence between morphisms and matrices, 


ing Matrices 


There is an interesting result concerning morphisms which we obtained 
towards the end of section 23.1.5, and which we now want to take up again. 
We showed that the set of all morphisms from a given vector space V to a 
given vector space U itself formsa vector space. We know that, for example, 
any morphism from R” to R" is represented by a matrix with n rows and m 
columns, and conversely, that any such matrix represents a morphism 
from R™ to R". So the set of all matrices with n rows and m columns forms 
a vector space (for given n and m) for operations corresponding to those 
for morphisms, i.e. addition of functions and multiplication of a function 
byanumber. Itis fairly easy to see that the corresponding matrix operations 
are as follows: 
(i) Ifk is any real number, then kA is the matrix obtained by multiplying 
each element of A by k. 
(ii) If A and B are two matrices each with n rows and m columns, then 
A A Bis the matrix with n rows and m columns obtained by adding 
corresponding elements of A and B. 


Example | 


(i) 3/ 1 2 3 6 
-1 3)=|-3 9 
4 7 H <2 
(ii) 1 2 3 1 2+ 
-1 3/4 \=3 9]={(-1)+(-3) 3+ 
rey 12 Sefer: 4412 7+21 
4 8 
= |-4 12 
16 28 


ee ale yal 


is not defined: it is equivalent to trying to “add” two functions with 
different domains, a 


It is clear from the definition of A that it is both commutative and 
associative. Because of its obvious similarities to addition, we use the 
symbol + for A\; for example, we write 


(; ) | 2 ‘ | 4 ) 
a = 

1 4 ot Sead, =I 0) 

and we call the operation matrix addition. 


We can use these two operations to define a third operation — matrix 
subtraction — by 


A-B=A+(-1)B 
The matrix (—1)B is usually written as — B. 


So now we have three operations: *, multiplication by a scalar, and +. 
The two operations + and * are binary operations on sets of matrices, 
although we have to take some care over defining the sets in which they 
operate. We have noted some of their properties: + is commutative and 


B 


Definition 1 


Definition 2 


Example 1 


Definition 3 


Definition 4 


(continued on page 44) 


Solution 23.24.3 


If we perform two rotations such as these successively, the result is a 
rotation through an angle f + 9 which is of course the same as a rotation 
through a + B. Hence 
(i) A rotation through B + «, 
(ii) Ry * Ry = Rose 
(iii) 24 ks +a) —sin(p + = 

pa \sin(B +o) cos (B + a) 
heed —sin ‘) ie a —sin i. 

* 


sin B cos 


(iv) Ry * Ry = : 
sin % COS %) 


pomp —sinfsinw —cosfsin« — sin fcos a 


sin Bcosa+cosfsing —sin isin « + cos f cos x 
(v) Comparing (iii) and (iv), we obtain the formulas 


cos (f + x) = cos fcos x — sin f sin x 


sin (B + a) = sin Bcos a + cos f sin x a 


(continued from page 43) 


associative; * is associative but not commutative. We have not as yet 
seen how * and + interact. In particular, are they distributive over each 
other? 


Exercise | 


By choosing some simple matrices, prove that + is not distributive over *, 
ie. that for some suitable* matrices A, B and C 


A+(B*C)#(A + B)*(A + C) | 


It is a relatively simple matter to disprove a conjecture by a counter- 
example, as in Exercise 1, but to prove a conjecture we have to argue in 
general terms. We shall not do this here, because the proof does not 
illustrate anything very important (except, perhaps, a need for an improved 
notation, which will be developed later); however, we record the fact 
that * is distributive over +. That is, for suitable matrices A, B and C, 


A*(B + C) =(A* B) + (A*C) 
and 
(A + B)*C =(A*C) + (B*C) 
Notice that we must make both statements because * is not commutative. 


There is one difficulty that we must mention; it concerns terminology. 
It is widespread practice to refer to the operation * as matrix multiplication 
and drop the symbol *, writing AB instead of A * B. This is most unfor- 
tunate, for although * is associative, and distributive over +, just as for x 
and + in the set of real numbers, * is not commutative, and whereas + 
comes from addition of mappings, * comes not from multiplication but 
from composition. 


However, because the practice is widespread we shall refer to the operation 
as matrix multiplication for the remainder of the course, and drop the 
symbol *. 


In the next section we collect together some of the differences between 
matrices and numbers, and also some of the similarities of the two algebras. 


+ By “suitable” we mean matrices with appropriate numbers of rows and columns, so that 
the operations are defined. 
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Solution 23.2.4.3 


Exercise 1 
(2 minutes) 


23.2.6 Matrix Algebra and the Algebra of Numbers 


A major difference between matrices and numbers is that, whereas we can 
add or multiply any two numbers, this is not the case for matrices. If we 
restrict ourselves to a set of matrices with the same (fixed) number of rows 
as columns, then we can add or multiply any two matrices from the set. 
We take it as implicit, then, in this section that all the matrices have the 
same number of rows as columns, and that this number is the same for all 
the matrices, (A matrix with the same number of rows as columns is called 
a square matrix.) 


Similarities 
Numbers Matrices 
Addition is commutative, at+b=b+a A+B=B+A 
Addition is associative. (a+ b)+ce=at+(b+e) (A+B)+C=A4(B+C) 
There isu zeroclement, a+0=O0+a=a A+0=0+ A= A,where 0 
is the matrix in which every 
entry is zero. 
Each clement has an (-h)+b=0 (-B)+B=0 
additive inverse. 
Mbltiplication is associative. (abe = a(be) (AB)C = A(BC) 
There is an “identity” ad=laza Al = 1A = A, where I is the 
clement. identity matrix defined on 
page 42 
“Multiplication” is A(B + C) = AB + AC 
distributive over addition. (A + BIC = AC + BC 
Certain product rules hold, AO =0A=0 
A(-B) = —AB 
(=a)(~b) = ab (—A)(~B) = AB 


Differences 


(i) Number multiplication is commutative; matrix multiplication is not. 
That is, there exist matrices A, B such that AB # BA. 


Asa consequence, the matrix expansion(A + B)? = A? + 2AB + B? 
is false, in general. The correct expansion is (A + B)? = 
A? + AB + BA + B?. The matrix AB + BA is equal to 2AB if and 
only if A and B commute, i.e. AB = BA. This may happen in certain 
cases (for example if B = /), but it is not generally true. 


(ii) The product of two non-zero numbers is non-zero, but the product of 
two non-zero matrices may be equal to the zero matrix. 


For example, if 


1 1 1 1 
A= ] and B= | ) 
1 1 =L =, 
then AB = O. (Note also that BA # O.) This example also illustrates 


the existence of a non-zero matrix B such that B? = 0. 
(iii) The cancellation law holds for numbers, but not for matrices. 


That is, if a, b, c are numbers and a # 0, then 
(ab = ac) > (b = c) 

But for matrices, 
(AB = AC) #(B = C). 


For example, if A and B are as above, then AB = AO, but B is not 
equal to the zero matrix, i.e. A cannot be cancelled. 
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Solution 23.2.5.1 Solution 23.2.5.1 


For example, with 


(continued from page 45) 


(iv) The equation x? = 0 has the unique solution x = 0, but the matrix 
equation X? = O has an infinite number of solutions, 


It can be shown that the general solution of the matrix equation 
X? = O for matrices with two rows and columns is 


r Ss 
X = 6 for arbitrary rand non-zero s, 
Ciciae 3 
s 
or 
(a) ‘ 
X= for arbitrary t. 
t 0) 
(v) The equation x? = —1 has no solution (in real numbers), but the 
matrix equation X? = —J does have solutions. 
We single out two solutions for special attention. These are the 
matrices 
O. —1, 0 1 
C= and —C = 
1 0 1 0 


Note that the matrix —/ is equal to the rotation matrix R, correspond- 
ing to a rotation about the origin through the angle m. The matrix C 
is the rotation matrix Ry. What transformation does —C correspond 


to? 
Exercise 1 Exercise 1 
“ ’ P (2 minutes) 
Give expansions of (A — B)? and (A — B)° which are valid for all square 
matrices, (There are 8 terms in the second expansion.) | | 
Exercise 2 Exercise 2 
(2 minutes) 


The equation x? = x has precisely two solutions, x = 0 or 1. By giving an 
example of a matrix with 2 rows and 2 columns, show that the matrix 
equation X? = X has solutions other than X = O or I. | | 


23.3. APPENDIX 


The “Dimension” Theorem 

This Appendix is optional. 

THEOREM 

If L isa morphism from a vector space V to a vector space U, then 


(dimension of L(V)) + (dimension of kernel) = (dimension of V) 


PROOF 


Let (Wy, U2, M3,-++,4p} be a basis for L(V) (ie. let L(V) have dimension p), 
and let {k,,ky,ky,---,k,} be a basis for the kernel (i.e. let the kernel have 
dimension m). Since each of the u’s belongs to L(V), there must be vectors 
in V which map to them. Suppose 


cn uy 
els 
Uy Uy 


If we can show that the set of vectors B = {k,,k2,-++yhyys Uys +++ Up} form 
a basis for V, then the theorem is proved, for this will show that V has 
dimension m + p. 


First of all, we show that any vector in V can be expressed in terms of these 
vectors, and then we show that B is a linearly independent set. 


(i) Let v be any element of V; then we can find ;’s (real numbers) such that 
Uv) = My + Ona +++ + Olly 
= a L(vy) + %L(uz) +--+ + oy L{v,) 


ie, 
LU) = L(ad, + ab, +++ + app) 
ie. 
Lv) — Lav, + a2 + ++* + %b,) = Uo 
ie, 
Lv — % 0, — 2202 — 2303 — +++ = tpl) = Uo 
Thus (v — %0, — % 2, — +++ — %,) is in the kernel of L. There exist, 


therefore, numbers f,,..., f,, such that 


Byki + Baka + ++* + Byki 


U— OR) — OP, — +++ — ab, 


u = Bik, + Bak + +++ + Big + O42, + yz + +++ + ob, 


Thus any vector in V can be expressed as a linear combination of the 
vectors ky, ky..++5KmsU1+++++ Up, Le. B spans V. 
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Solution 23.2.6.1 Solution 23.2.6.1 
(4 — BP = A? — AB- BA+ B? 
(A — BP = 43 — A?B— ABA + AB? — BA? + BAB + B?A— B® 
: a 


Solution 23.2.6.2 Solution 23,2,6,2 


0 O\. 
X= 0 1 is an example. 


(continued from page 47) 


(ii) We now show that B is a linearly independent set. Consider the 
expression 


By ky + Boks +++ + Byki + 210) + Wz +o + Gpbyp =o (1) 
Then 
L(Biky + Baky + +++ + oy) = Yo 


By LAK) + BrLk2) + +++ + %LAv,) = Uo 


ot, L(v1) + XL (v2) + +++ + %pL{u,) = Uo 
because all the k’s are in the kernel and so L(k;) = uy. But 


Lv) =u, 
Lvs) = up 
Lv,) = Uy 


and the u’s have been chosen to be linearly independent. Thus, all the 
’s are zero, and expression (1) reduces to 
Byki + Baka +++ + Babin = Yo 


But the k’s are also linearly independent, and so the p's are zero. 
Thus (1) implies that all the a’s and f’s are zero, and so the set is linearly 
independent, as required, 
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